A study on deep reinforcement learning-based crane scheduling model for uncertainty tasks
نویسندگان
چکیده
Abstract Aiming at the crane scheduling problem for uncertainty tasks in multi-crane situation, this article proposes a deep reinforcement learning-based modeling method that is not dependent on mathematical planning and has certain generality. First, process integrated into learning framework which orbit space of transportation task environmental information intelligent agent. Second, interactive mode between algorithm environment adjusted to adapt combined model. Last, model constructed by optimizing reward discount factor, rate, function intensive mode. Testing carried out based practical one steelmaking workshop. Scheduling proposal generated all are completed within planned time, verifies feasibility Results show compared with manual plan, new reduces total completion time 11.52%, collision routes decreases 57.14%, negative distance shortens 55.26%. The high efficiency therefore verified.
منابع مشابه
Operation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm
: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...
متن کاملA Study of Count-Based Exploration for Deep Reinforcement Learning
Count-based exploration algorithms are known to perform near-optimally when used in conjunction with tabular reinforcement learning (RL) methods for solving small discrete Markov decision processes (MDPs). It is generally thought that count-based methods cannot be applied in high-dimensional state spaces, since most states will only occur once. Recent deep RL exploration strategies are able to ...
متن کاملDeepCAS: A Deep Reinforcement Learning Algorithm for Control-Aware Scheduling
We consider networked control systems consisting of multiple independent closed-loop control subsystems, operating over a shared communication network. Such systems are ubiquitous in cyber-physical systems, Internet of Things, and large-scale industrial systems. In many large-scale settings, the size of the communication network is smaller than the size of the system. In consequence, scheduling...
متن کاملReproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control
Policy gradient methods in reinforcement learning have become increasingly prevalent for state-of-the-art performance in continuous control tasks. Novel methods typically benchmark against a few key algorithms such as deep deterministic policy gradients and trust region policy optimization. As such, it is important to present and use consistent baselines experiments. However, this can be diffic...
متن کاملDeep Episodic Value Iteration for Model-based Meta-Reinforcement Learning
We present a new deep meta reinforcement learner, which we call Deep Episodic Value Iteration (DEVI). DEVI uses a deep neural network to learn a similarity metric for a non-parametric model-based reinforcement learning algorithm. Our model is trained end-to-end via back-propagation. Despite being trained using the model-free Q-learning objective, we show that DEVI’s model-based internal structu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: High Temperature Materials and Processes
سال: 2022
ISSN: ['0334-6455', '2191-0324']
DOI: https://doi.org/10.1515/htmp-2022-0040